NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Multimodal Transformers for Real-Time Surgical Activity Prediction

https://doi.org/10.1109/ICRA57147.2024.10611048

Weerasinghe, Keshara; Reza_Roodabeh, Seyed Hamid; Hutchinson, Kay; Alemzadeh, Homa (May 2024, IEEE)

Full Text Available
Evaluating the Task Generalization of Temporal Convolutional Networks for Surgical Gesture and Motion Recognition Using Kinematic Data

https://doi.org/10.1109/LRA.2023.3292581

Hutchinson, Kay; Reyes, Ian; Li, Zongyu; Alemzadeh, Homa (July 2023, IEEE Robotics and Automation Letters)

Full Text Available
Towards Surgical Context Inference and Translation to Gestures

https://doi.org/10.1109/ICRA48891.2023.10160383

Hutchinson, Kay; Li, Zongyu; Reyes, Ian; Alemzadeh, Homa (May 2023, IEEE)
COMPASS: a formal framework and aggregate dataset for generalized surgical procedure modeling

https://doi.org/10.1007/s11548-023-02922-1

Hutchinson, Kay; Reyes, Ian; Li, Zongyu; Alemzadeh, Homa (May 2023, International Journal of Computer Assisted Radiology and Surgery)

Purpose: We propose a formal framework for the modeling and segmentation of minimally invasive surgical tasks using a unified set of motion primitives (MPs) to enable more objective labeling and the aggregation of different datasets. Methods: We model dry-lab surgical tasks as finite state machines, representing how the execution of MPs as the basic surgical actions results in the change of surgical context, which characterizes the physical interactions among tools and objects in the surgical environment. We develop methods for labeling surgical context based on video data and for automatic translation of context to MP labels. We then use our framework to create the COntext and Motion Primitive Aggregate Surgical Set (COMPASS), including six dry-lab surgical tasks from three publicly available datasets (JIGSAWS, DESK, and ROSMA), with kinematic and video data and context and MP labels. Results: Our context labeling method achieves near-perfect agreement between consensus labels from crowd-sourcing and expert surgeons. Segmentation of tasks to MPs results in the creation of the COMPASS dataset that nearly triples the amount of data for modeling and analysis and enables the generation of separate transcripts for the left and right tools. Conclusion: The proposed framework results in high quality labeling of surgical data based on context and fine-grained MPs. Modeling surgical tasks with MPs enables the aggregation of different datasets and the separate analysis of left and right hands for bimanual coordination assessment. Our formal framework and aggregate dataset can support the development of explainable and multi-granularity models for improved surgical process analysis, skill assessment, error detection, and autonomy.
more » « less
Full Text Available
Runtime Detection of Executional Errors in Robot-Assisted Surgery

https://doi.org/10.1109/ICRA46639.2022.9812034

Li, Zongyu; Hutchinson, Kay; Alemzadeh, Homa (May 2022, 2022 International Conference on Robotics and Automation (ICRA))
A Reactive Autonomous Camera System for the RAVEN II Surgical Robot

https://doi.org/10.1109/ISMR48331.2020.9312934

Hutchinson, Kay; Yasar, Mohammad Samin; Bhatia, Harshneet; Alemzadeh, Homa (November 2020, 2020 International Symposium on Medical Robotics (ISMR))
null (Ed.)
The endoscopic camera of a surgical robot pro- vides surgeons with a magnified 3D view of the surgical field, but repositioning it increases mental workload and operation time. Poor camera placement contributes to safety-critical events when surgical tools move out of the view of the camera. This paper presents a proof of concept of an autonomous camera system for the Raven II surgical robot that aims to reduce surgeon workload and improve safety by providing an optimal view of the workspace showing all objects of interest. This system uses transfer learning to localize and classify objects of interest within the view of a stereoscopic camera. The positions and centroid of the objects are estimated and a set of control rules determines the movement of the camera towards a more desired view. Our perception module had an accuracy of 61.21% overall for identifying objects of interest and was able to localize both graspers and multiple blocks in the environment. Comparison of the commands proposed by our system with the desired commands from a survey of 13 participants indicates that the autonomous camera system proposes appropriate movements for the tilt and pan of the camera.
more » « less
Full Text Available
Analysis of executional and procedural errors in dry‐lab robotic surgery experiments

https://doi.org/10.1002/rcs.2375

Hutchinson, Kay; Li, Zongyu; Cantrell, Leigh A.; Schenkman, Noah S.; Alemzadeh, Homa (February 2022, The International Journal of Medical Robotics and Computer Assisted Surgery)

Abstract BackgroundAnalysing kinematic and video data can help identify potentially erroneous motions that lead to sub‐optimal surgeon performance and safety‐critical events in robot‐assisted surgery. MethodsWe develop a rubric for identifying task and gesture‐specific executional and procedural errors and evaluate dry‐lab demonstrations of suturing and needle passing tasks from the JIGSAWS dataset. We characterise erroneous parts of demonstrations by labelling video data, and use distribution similarity analysis and trajectory averaging on kinematic data to identify parameters that distinguish erroneous gestures. ResultsExecutional error frequency varies by task and gesture, and correlates with skill level. Some predominant error modes in each gesture are distinguishable by analysing error‐specific kinematic parameters. Procedural errors could lead to lower performance scores and increased demonstration times but also depend on surgical style. ConclusionsThis study provides insights into context‐dependent errors that can be used to design automated error detection mechanisms and improve training and skill assessment.
more » « less

Search for: All records